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ABSTRACT 

We analyse catalogues of galaxies from the Sloan Digital Sky Survey (SDSS) and find 
a weak but significant assembly-type bias, where old galaxies have a higher clustering 
amplitude at scales r > 1 /i~^Mpc than young galaxies of equal magnitude. When 
using the definition of age based on the luminosity-weighted stellar age, the difference 
(-H \ increases from -f9 < Mr < -18 to -22 < Mr < -21 SDSS galaxies, from a 25 

per cent to a 45 per cent effect. In contrast, when using the definition based on the 
D„4000 index, the clustering amplitude at large scales shows a weaker signal-to-noise 
for the assembly bias detection. In semi-analytic galaxies the difference in clustering, 
using the stellar age and luminosity, is similar at distances beyond 3 /i~^Mpc, but 
they overpredict the assembly bias found in SDSS galaxies at smaller scales. We then 
adapt the model presented by Lacerna & Padilla (Paper I) to redefine the overdensity 
peak height, that traces the assembly bias such that galaxies in equal density peaks 
show the same clustering regardless of their age, but this time using observational 
features. The proxy of peak height for observational data consists in the luminosity 
inside cylinders of different radii centred in each galaxy. This radius is parameterized 
as a funcion of stellar age and magnitude Mr- The best-fitting set of parameters that 
make the assembly bias practically absent for both simulated and real galaxies are 
similar (a — 0.75, h — —0.36 and a — 1, h = —0.36, respectively). By estimating a 
weighted average for the values in the range of la for semi-analytic and SDSS galaxies, 
it is obtained that the set of best-fitting parameters for the radius of the cylinder is 
a = 0.76 ± 0.31, b = —0.39 ± 0.18. The idea behind the parametrization is not to 
minimize the bias, but it is to use this method to understand the physical features 
that produce the assembly bias effect. Even though in this case we consider neighbour 
galaxies in cylinders in contrast to using a smooth density field, e.g. the dark matter 
particles distribution, our simulations show that the new method is correlated to the 
definition of Paper I as both measure the crowding around objects. 
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1 INTRODUCTION 

In the current hierarchical clustering models, galaxies 
form by gas cooling inside virialized cold dark mat- 
ter haloes in a way that makes their properties depend 
only on the halo mass in which they form. Recently, re- 
sults from A'^-body simulations have shown that the halo 
clustering at large-scales depends on the halo formation 
history in addition to the halo m ass (Cao et al. 2005': 
Wechsler et al. 20061: [Zhu et al. 20061 : [Cao & White 2007; 
Faltenbacher fc White 2010l '). This effect, that is not ex- 
pected from the excursion set theory, was termed "assem- 
bly bias". For example, Gao et al. (2005) found that low- 
mass haloes which were assembled at high redshifts are more 



strongly correlated than those of the same mass that assem- 
bled more recently. Therefore, one could ask whether the 
assembly bias could also affect galaxy properties. 

By means of a semi-analytic model of galaxy formation, 
Croton et al. (2007) showed that galaxy clustering on large 
scales is affected by the correlation of halo environment with 
halo assembly history. Lacerna et al. (2011, hereafter Paper 
I) found an assembly bias effect in the semi-analytic galaxies 
from the Lagos, Cora & Padilla (2008) model, where the old 
population has a higher clustering at large scales than the 
young population with the same host halo mass. 

On the observational side, the existence of the assembly 
bias is still debated. For example, Skibba et al. (2006) used 
galaxies from the Sloan Digital Sky Survey (SDSS; York 
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et al. 2000) to measure marked correlation functions which 
in addition to the halo model suggest that the correlation 
between halo formation and environment is not important 
for the bright galaxy population. Tinker, Wetzel & Con- 
roy (2011) found similar results, indicating that this depen- 
dence arises just from the correlation between halo mass 
function and environment. However, Yang, Mo, & van den 
Bosch (2006) and Wang et al. (2008) found that groups, 
selected from the Two Degree Field Galaxy Redshift Sur- 
vey (2dFGRS; CoUess et al. 2001) and SDSS respectively, 
with red central galaxies are more strongly clustered than 
groups of the same mass but with blue central galaxies, this 
effect being much more important for less massive groups. 
Zapata et al. (2009) found that SDSS galaxy groups of sim- 
ilar mass and different assembly histories show differences 
in their fraction of red galaxies. Furthermore, Cooper et al. 
(2010) studied the relationship between the local environ- 
ment and properties of galaxies in the red sequence. After re- 
moving the dependence of the average overdensity on colour 
and stellar mass by setting a density parameter as a estima- 
tor of environment such as it is independent of these galaxy 
properties, they still found that galaxies with older stellar 
populations occupy regions of higher overdensities compared 
to younger galaxies of similar colours or stellar masses. Also, 
Alonso et al. (2012) show that galaxies in groups in low 
global densities are prone to show higher star formation com- 
pared to galaxies of equal properties and similar host groups 
but in high density environments. These results show that 
the concept of assembly bias is applicable to the galaxies 
in addition to dark matter (DM) haloes and would then in- 
fluence the physics of galaxy formation. Also, this effect is 
important when galaxies are used to constrain cosmological 
parameters. Wu et al. (2008) showed that upcoming photo- 
metric catalogues such as the Dark Energy Survey (DESfl 
and the Large Synoptic Survey Telescope (LSST; LSST Sci- 
ence Collaborations et al. 2009j!| can infer significantly bi- 
ased cosmological parameters from the observed clustering 
amplitude of galaxy clusters if the assembly bias is not taken 
into account. 

Paper I presented a new peak height definition differ- 
ent from the halo virial mass, where semi-analytic galaxies 
in haloes of a given mass but different age do not show the 
assembly bias effect in the two-halo regime. This new def- 
inition simply adds mass to that of the dark matter halo 
in a way that makes old, low-mass haloes suffer the most 
extreme peak height change with respect to what results 
of using the virial mass as its proxy. In this paper we will 
test this model using galaxies selected from the SDSS cat- 
alogue with the difference that, instead of using the total 
mass inside spheres of different radii, we will use the total 
luminosity inside cylinders along the position of each galaxy, 
which we will use to make a more adequate proxy for a peak 
height, in a similar way to what was proposed in Paper I. 
We will discuss the implications of using discrete tracers of 
the density field that consider neighbour galaxies instead of 
a smooth density field. 

The outline of this paper is as follows. In Section [2] we 
present the catalogues of galaxies from the SDSS and also 
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from a semi-analytic model. We characterize the dependence 
of age on local density in Section [S] We study the assembly 
bias effect, namely, the large-scale clustering of galaxies of 
a given luminosity depends on the age, for both the SDSS 
and semi-analytic galaxies in Section [l] We adopt a mod- 
ified version of the model from Paper I in Section O but 
using observational features. The nature of the objects that 
are being considered with the proxy of the overdensity peak 
height is shown in Section |S] where we discuss the implica- 
tions of using discrete tracers of the density field that con- 
sider neighbour galaxies instead of a smooth density field. 
The conclusions of our results are presented in Section [T] 



2 DATA 

2.1 SDSS galaxies 

We use galaxies from the spectroscopic SDSS Data Re- 
lease 4 (DR4, Adelman- McCarthy et al. 2006), which 
contains ^^480, 000 galaxies. After cleaning of repeated 
galaxies, saturated objects, galaxies close to the edges 
of the survey, etc., the sample includes ~400,000 galax- 
ies with mea sured spectra and photometry in the bands 
u, g, r, I, z l|Fukugita et al. 19961 : ISmith et al. 20021 ) that 
are Jy-corrected to re dshift 0.1 using kcorrect v4_2 
(jBlanton fc Roweis 20071 ). The main motivation to use these 
data is the measured luminosity-weighted ages by Gallazzi 
et al. (2005), which are essential to this work|f| There are 
183,482 galaxies with estimates of stellar age in the mag- 
nitude range -22 < < -18 (Table [TJ. We then select 
four samples of centres and tracers detailed in Tables [T] and 
(2] respectively. Figure [T] shows the absolute magnitude in 
the r-band, Mr, as a function of redshift with the choice of 
volume-limited samples, which avoid observational bias. 

The SDSS galaxies will be useful to study whether the 
assembly bias effect is present in these observed objects thus 
affecting their properties, as will be shown in Section 3] 



2.1.1 Age parameter St 

As was the case in Paper I, one of the most important pa- 
rameters throughout this work is the age. For SDSS galaxies, 
we use the luminosity-weighted age (or, simply, stellar age) 
and the L»„4000 index (Balogh et al. 1999). The latter is 
defined as 



Ai4000 



(4000 - 4100 A) 
^1(3850 - 3950 A) 



(1) 



where Fi, is the average fiux density in the specified bands 
(in the rest frame), and it is smaller for young stellar pop- 
ulations than for old, metal-rich galaxies. In our samples 
there are 183,482 galaxies with estimates of stellar age (Ta- 
ble [1} and 205,076 with D„4000, both in the magnitude 
range -22 < Mr < -18. 



^ Luminosity-weighted ages are obtained from 

http: / /www. mpa-garching.mpg.de/SDSS/DR4/Data/stelIarmet. html 
which includes the DR4 version of the tables used in Gallazzi et 
al. (2005). 



///. The nature of assembly bias 3 



Table 1. Centre samples used to measure the cross-correlation functions. The second column indicates the number of objects with 
luminosity-weighted ages, the third column corresponds to the magnitude bin, and the last column indicates the redshift range of each 
volume-limited sample (see Fig. [TJ. 



N 



Mr 



redshift 



centres sample 


1 


10 


953 


-19 


< 


Mr 


< 


-18 


0.010 


< 


z 




0.050 


centres sample 


2 


41 


167 


-20 


< 


Mr 


< 


-19 


0.012 


< 


z 




0.078 


centres sample 


3 


78 


632 


-21 


< 


Mr 


< 


-20 


0.020 


< 


z 




0.120 


centres sample 


4 


52 


730 


-22 


< 


Mr 


< 


-21 


0.035 


< 


z 




0.175 



Table 2. Same as Table [T] but for tracers. In this case, the third column corresponds to a magnitude limit. 



N 



Mr 



redshift 



tracers sample 


1 


33 907 


Mr 


< 


-18 


0.010 < 


z 


s£ 0.050 


tracers sample 


2 


77 114 


Mr 


< 


-19 


0.012 < 


z 


s£ 0.078 


tracers sample 


3 


103 135 


Mr 


< 


-20 


0.020 < 


z 


s; 0.120 


tracers sample 


4 


60 180 


Mr 
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-21 


0.035 < 


z 


s£ 0.175 
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Figure 1. Absolute magnitude Mr as a function of redshift for 
SDSS galaxies. Four volume-limited samples of centres and tracers 
are chosen to perform cross-correlation functions without obser- 
vational bias. Objects inside the rectangles in red, blue, green, 
and cyan correspond to the centre samples 1, 2, 3, and 4 in Ta- 
ble [T] For each sample, the tracers (see Table \2\ are in the same 
range of redshift (delimited by the vertical lines) as centres but 
also include brighter objects. 



Figure 2. Stellar age as a function of the absolute magnitude 
Mr for SDSS galaxies, which correspond to the volume-limited 
samples described in TablelTI Due to the large number of available 
galaxies, only 10,000 of them, randomly chosen, are plotted as 
black points. Red squares show the medians for each magnitude 
bin. Error bars correspond to the 10 and 90 percentiles of the 
stellar age distribution. The median population of fainter galaxies 
is younger than that of the brighter objects. 



To study the assembly bias in galaxies, where old ob- 
jects may have a different clustering than young objects of 
the same mass or luminosity, it is not convenient to work di- 
rectly with the stellar age, for example, because this quantity 
correlates with the mass or the luminosity of galaxies. For 
instance, luminous galaxies have, on average, older stellar 



populations (see Figure [2ll as indicated by their luminosity- 
weighted age. In Paper I, we presented a definition of age 
which is independent of the mass or the luminosity through 
the St dimensionless parameter 



4 Lacerna, Padilla & Stasyszyn 



(tjMr)) 



(2) 



where for the i th galaxy, ti is its stellar age, (t{Mr)) is the 
median stellar age as a function of Mr, and at (Mr) is the 
dispersion around the median obtained from the 10 and 90 
percentiles of the distribution (error bars in Fig. [2|. Then, 
objects with positive (negative) values of St correspond to 
old (young) populations with respect to the median distri- 
bution of stellar ages of their luminosity. The scatter in the 
observational stellar age can be higher than the stellar age 
obtained in simulated galaxies such as the model described 
in Section [2^2] 

We also use _D„4000 in Equation © as a proxy for age 
considering its dependence on the absolute magnitude Mr 
(similar to Figure [2]). 

2.2 The numerical simulation 

We use synthetic galaxies from the SAG2 model by La- 
gos, Cora & Padilla (2008, see also Lagos, Padilla & Cora 
2009), who combine a cosmological A'^-body simulation of 
the concordance ACDM universe and a semi-analytic model 
of galaxy formation to follow the evolution of the barionic 
component of haloes accross cosmic history. The numerical 
simulation consists in a periodic box of 60 Mpc on a side 
that contains 256^ dark matter particles with a mass resolu- 
tion of ~10' h'^ M0. The cosmology used is ^Itot ~ 1, 
= 0.28 (with a baryon fraction of 0.16), = 0.72, as = 0.9, 
h — 0.72, and spectral index Ua = 1. Dark matter haloes are 
identified as self-bound structures that contain at least 10 
particles using a friends-of-friends (FOF) algorithm (Davis 
et al. 1985). Another algorithm (SUBFIND; Springel et al. 
2001) is applied to these groups in order to find substruc- 
tures with at least 10 particles. Typically, the most mas- 
sive subhalo is considered the main subhalo, while the other 
subhaloes are satellites within a FOF halo. The galaxy pop- 
ulation in the semi-analytic model is generated using the 
merger histories of dark matter haloes and their embedded 
subhaloes. The virial mass for galaxies corresponds to the 
virial mass of their respective DM substructures. 

For the simulated galaxies, we will use the mass- 
weighted stellar age in equation 0. Magnitudes in the 
SAG2 semi-analytic model are in the Johnson system. 
Therefore, we apply a correction to transform these mag- 
nitudes to the ugriz system to compare with SDSS galaxies 
using the equations of Jester et al. (2005) where, for objects 
with R-I < 1.15 and U-B < 0, 



r = V -OAQ{B- V)+i).ll 



and, for objects with just R— I < 1.15, 



= V-0.42(B-y) + 0.11. 



(3) 



(4) 



Table 3. Mean virial mass M^ir in units of Mq for old (5t > 
0.05) and young {6t < —0.05) galaxies of a given magnitud Mr 
using the SAG2 semi-analytic model (third column) and the semi- 
analytic model of De Lucia & Blaizot (2007) (fourth column). 
Given uncertainties the differences are not significant. 



Mr 






Ages 


log(M„i,)" 


log(M,,,)'' 


-19 < Mr 


< 


-18 


Old 


10.6 


10.7 








Young 


10.8 


10.8 


-20 < Mr 


< 


-19 


Old 


11.1 


11.0 








Young 


11.1 


11.1 


-21 < Mr 


< 


-20 


Old 


11.6 


11.5 








Young 


11.5 


11.5 


-22 < Mr 


< 


-21 


Old 


12.2 


12.2 








Young 


12.0 


11.9 



''SAG2. 

''De Lucia & Blaizot (2007). 



show a strong correlation with the inferred r-band absolute 
magnitude, which is strong for the virial mass, thus we are 
able to use Mr as a proxy o f mass, specially that of the 
substructure (jConrov et al. 2006 ; Si mha et al. 20121 ). 

One may think that old galaxies have larger mass-to- 
light ratios than young galaxies of equal luminosity. Table 
[3] shows the mean virial mass for old and young galaxies 
of equal luminosities using the SAG2 semi-analytic model 
and the semi-analytic model of De Lucia & Blaizot (2007)0 
Therefore, there is no a strong dependence regarding the 
virial mass when splitting the old and young galaxies for a 
given luminosity. Furthermore, in only one out of the four 
samples of our model, the halo mass is lower for the older 
population, which goes in the opposite direction to the com- 
mon sense interpretation of the cuts in age. This is due to 
the fact that we are looking at narrow ranges in luminosity. 

Figure |4] shows that the scatter between virial mass and 
the stellar mass is similar to that using Af,.- Therefore, there 
is little difference in using luminosity or stellar mass as a 
proxy for mass (at least that of the substructure). 



3 DEPENDENCE OF RELATIVE AGE ON 
LOCAL DENSITY 

To adapt the procedure of Paper I to galaxies, we need to 
understand the age-density aspect of the assembly bias in 
both simulations and observations. Figure [5] shows a relative 
stellar age map in our simulation, where the relative age 5t is 
calculated with respect to virial mass using equation (2) of 
Paper I. The colours represent different average values of 5t 
in grids conformed by cells of 2 Mpc on a side in a slice 5 
Mpc thick of our simulation. In general, regardless if the 
relative stellar age is calculated with respect to virial mass 



Figure [3] shows scatter plots of halo DM mass (virial mass 
as explained above and friends-of-friends in the left-hand 
and right-hand panels, respectively) vs. Mr - Both definitions 



This model ma de use of the Millennium Simulation 
JSpringel et al. 2005h . We find that this quantity is very simi- 
lar for galaxies of different age for a given magnitude using both 
models. 
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Figure 3. Scatter plots of mass versus the inferred absolute magnitude Mr for galaxies in the simulation. Left: as a function of virial 
mass, M^ir- There is a strong correlation between M^^^ and Mr, where more massive objects are brighter. Right: the mass is given by 
the friends-of-friends (FOF) approach, which is defined as the total mass of a FOF halo (i.e. a satellite galaxy has a My^j equal to the 
the central galaxy). This latter definition blurs the correlation seen in the left panel. 
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Figure 4. Scatter plots of virial mass versus the stellar mass for 
galaxies in the numerical simulation. As well as the luminosity, 
there is a strong correlation between M„ir and stellar mass. 



Figure 5. Age maps using the St parameter based on the stellar 
age with respect to virial mass in a slice of the numerical simula- 
tion. The colour bar indicates the different ranges in Sf 



o luminosity, values of 5t ^ 0.05 correspond to relatively 
old populations, while St ^ -0.05 to young populations. As 
can be seen, the latter appear less clustered than the old 
population even after, in this case, the mass dependence is 
removed using the definition of St. 

We quantify this behaviour using the Z)„4000 index. 
The distribution of D„4000 for our SDSS galaxies is shown 
in the left-hand panel of Figure [6] for different ranges of ab- 



solute magnitude. We see a clear bimodality for all of the 
magnitude bins plotted except for the brightest one, prob- 
ably because most of these objects are part of the 'red se- 
quence' (poor gas galaxies with low star formation rate). 
The vertical-dotted line shows the cut that divides the two 
populations; active galaxies (Dn4000 < 1.65) and quenched 
galaxies (D„4000 > 1.65). Note that it is very similar to the 
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condition used by Tinker, Wetzel & Conroy (2011), where 
quenched objets have Z)„4000 > 1.6. The right-panel of Fig. 
[6] shows the fraction of quenched SDSS galaxies as a func- 
tion of the large-scale environment. The latter is defined by 
counting the number of neighbour galaxies inside a sphere 
of 10 Mpc radius around each galaxy (see Muldrew et 
al. 2012 for a revision of different environment definitions in- 
cluding this one). We also restrict to neighbours above the 
magnitude limit for each sample. We see that the fraction of 
quenched objects increases to higher densities. Our results 
are qualitatively similar to those found by Tinker, Wetzel & 
Conroy (2011) using this index. 

The trend that old objects are located in regions of 
higher densities than young objects of equal mass or magni- 
tude is also readily seen in the two-point correlation function 
(see Section |4l for example), where the ones with the higher 
clustering at large scales are those located on higher density 
regions, which could be indicating the locations of higher 
initial density peaks. The correlation function, ^(r), has the 
advantage that it obtains an accurate characterization of the 
spatial distribution of objects from kpc scales out to Mpc 
scales by simply measuring the excess of pairs at a distance 
r with respect to a random distribution. Therefore, ^(r) is a 
useful tool to study the large-scale environment. The over- 
all results will support that older populations are located 
in higher densities compared to younger ones regardless the 
luminosity. 

High density environments play a key role in properties 
such as the formation age. Later in this work we will use 
this density-age connection to obtain a peak-height proxy. 
We will show that the crowding around objects helps us 
trace the assembly bias effect as well as the smooth density 
field approach introduced in Paper I. 



4 ASSEMBLY BIAS IN SDSS GALAXIES 

In this section, we will test whether SDSS galaxies show an 
assembly type bias by using the two-point correlation func- 
tion for the centres and tracers in the samples of Tables [T] 
and [21 Due to the redshift space distortions, we have to cal- 
culate first the cross-correlation in two coordinates ^(cr, vr), 
where cr is perpendicular and tt is parallel to the line of sight. 
Figure [7| shows the resulting ^(cr, tt) for the old and young 
galaxies in the magnitude range —21 < Mr < —20. As can 
be seen there is a subtle difference between both populations 
due to their slightly offset clustering amplitudes (c.f. Figure 
[8|, but also possibly due to different finger-of-god and infall 
effects. Given that the differences in infall are too subtle, we 
will not use the these profiles as estimators of the assembly 
bias in constrast to Paper 10 

I use the jackknife technique to estimate the variance 
error in the calculation of ^(cr, tt). In particular the original 
data is divided in 20 subsamples. The correlation function 
is calculated by discarding one of the subsamples at a time, 
and then it is compared with ^(ct, tt) obtained for the whole 
sample in order to build the error estimator. 

We integrate over the tt component to obtain the pro- 
jected correlation function i^(cr), 

^ Except in Section |6] where we reproduce the method used in 
Lacerna & Padilla (2011). 



Ojia) = 2 f "^''^ aa,n)dn, (5) 



where -Kmin = 0.1 h~^Mpc and iVmax ~ 30 /i~^Mpc. 
This can be inverted to obtain the real-space cross- 
correlation function using the relation described by 
Saunders, Rowan-Robinson & Lawrence (1992, see also Paz, 
Stasyszyn & Padilla 2008), 



(6) 

The panels of Figure [8] show ^(r) for old and young ob- 
jects, selected using ranges in 5t, for samples of different lu- 
minosities. Error bars are calculated taking into account the 
error estimate for ^(cr, tt) and using equations ([5} and ([6]). 
We systematically find that the old galaxies show a higher 
clustering relative to the young population at large scales 
{r > 1 Mpc) for different ranges in Mr- This differ- 
ence typically increases toward brighter objects from a 25 
per cent to a 45 per cent effect. Therefore, SDSS galaxies of 
equal luminosity show a significant dependence of clustering 
amplitude with the age since the differences are over a 20 per 
cent effect. As can be seen from Table [S] the mass-to-light 
ratio M/L is not higher for old galaxies at fixed luminos- 
ity in the simulation, so that this dependence of clustering 
amplitude with age is the assembly bias effect. 

This result is in agreement with Wang et al. (2008), 
Zapata et al. (2009) and Cooper et al. (2010) who find an 
assembly-type bias in galaxies. Other works, however, men- 
tion that the environmental dependence in galaxies is not 
important at large-scales. For example. Tinker, Wetzel & 
Conroy (2011) found that the fraction of quenched galaxies 
increases with the large-scale environment, but this arises 
just from the dependence of the halo mass on the environ- 
ment. They use the -D„4000 parameter to classify a quenched 
galaxy (Z)„4000 > 1.6). We find that when using this index 
to separate active/young and quenched/old objects, the dif- 
ference in clustering amplitude at large scales due to assem- 
bly bias is not clear. The left-panel of Figure [9] shows the 
ratio of the correlation function of old galaxies to the total 
sample of a given magnitude and the same ratio but using 
young galaxies, where the determination of age is based on 
D„4000 in Equation ((2)1. The difference in the clustering am- 
plitude between the old and young population is very noisy 
using this estimator for the age, in constrast to the results 
obtained using the luminosity-weighted stellar age shown in 
the right panel (see also lower boxes of Fig. [S]). Therefore, 
the _D„4000 index is less adequate to show clearly the en- 
vironmental dependence at large scales, even when we have 
12 per cent more galaxies with this index than with the stel- 
lar age (c.f. Section [2.ip . Probably, this is due to the fact 
that the Z)„4000 index is related with star formation by re- 
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Figure 6. Left: distribution of the D„4000 index for SDSS galaxies, normalized by 10,000 of them, for four bins in magnitude. We see a 
clear bimodality for all except the brightest one. The vertical-dotted line is our cut to define quenched galaxies, Z)„4000 > 1.65. Right: 
fraction of quenched SDSS galaxies as a function of the environment for different magnitude bins. 
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Figure 7. Cross-correlation functions in the directions parallel and perpendicular to the line of sight for old (left panel) and young (right 
panel) SDSS galaxies in the magnitude range —21 < M,- < —20. The colour in the figure key indicates the amplitude of the correlation 
function. The three contour levels, from large to small separations, correspond to ^((T, tt) = 0.5, 1.0 and 5.0, respectively. 



cent bursts in contrast to the luminosity-weighted age which 
covers an important range of star formation histories|f| 



4.1 Cross-correlation functions for the simulated 
galaxies 



^ Gallazzi et al. (2005) includes the Dn4000 index among other 
spectral features to constrain the luminosity-weighted age. 



We use the z = model galaxies and then select four samples 
of centres and tracers in the same ranges of Mr indicated in 
Tables [T] and [2l respectively. Figure [TOl shows the results of 
the different cross-correlation functions. Error bars are cal- 
culated using the jackknife method, which has been shown 
as a robust error estimator for cosmological samples. In gen- 
eral, this method consists in dividing the sample in parts. 
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Figure 8. Cross-correlation functions in real space for SDSS galaxies (see Section |4] for details). The bins in magnitude Mr are 
indicated in each panel. Old and young galaxies are shown in red circles and blue squares, respectively. Error bars were calculated using 
the jackknife method. The lower box in each panel corresponds to the ratio between the correlation function of the old population and 
the total population of the selected sample and, also, between the young population and the total one (dot-dashed red and dotted blue 
lines, respectively). The error of the ratio between the ^(r) of the old and young objects is shown as a shaded region around the unit 
ratio. 



The correlation function is calculated by discarding one of 
the subsamples at a time, £,i{r), and then it is compared 
with the correlation function obtained for the whole sample 
^(r) as follows 

1 ^ 

1=1 

where A'^ is the total number of subsamples. In particular, 
samples selected from the numerical simulation are divided 
in ten parts, A'^ = 10 (for samples selected from SDSS data. 



A'^ — 20). As can be seen from these panels, old galaxies (red 
circles) are more strongly clustered than the young objects 
(blue squares), with average differences of up to a factor of 
two in the clustering amplitude at large scales, r > 1 
Mpc. 

In order to compare the results from real and simulated 
galaxies. Figure [TTI shows the ratio ^(r) of the old galaxies 
to that of the young galaxies from the simulation and SDSS- 
DR4 (solid and dash lines, respectively) . Both ratios are sim- 
ilar for distances beyond 3 Mpc, especially for brighter 
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Figure 9. Ratios between the correlation function of the old SDSS galaxies and the total population of each selected sample (dot-dashed 
red lines) and, also, between the young SDSS galaxies and the total one (dotted blue lines), where the definition of age for the galaxies 
is based on the D„4000 index (left panel) and the luminosity- weighted stellar age (right panel). Errors are not shown for clarity. 



magnitudes. The clustering of old objects is typically thirty 
per cent higher than that of young galaxies. At lower scales 
the ratio is higher for the semi-analytic galaxies, while it re- 
mains constant for the SDSS galaxies. This difference may 
be explained, for instance, by the simplified method of circu- 
lar orbits in the simulation that could affect the profiles on 
the one-halo term (Lagos, Cora & Padilla 2008). Note that 
Guo et al. (2011) found that the projected autocorrelation 
functions for their semi-analytic galaxies are higher than 
for observations at scales r < 1 Mpc. They suggest what 
this is due to the fact that their assumed fiuctuation am- 
plitude, (Tg = 0.9, the same value used in this work, is too 
high. The latest resuhs of WMAP7 (Komat su et al. 20111 ) 
estimate a% = 0.809 ± 0.024. Furthermore, Figure [TTI shows 
the ratios of two correlation functions, pointing to a stronger 
overprediction for old galaxies which reside in denser regions, 
which is consistent with Guo et al. (2011), who also find that 
the simulation overpredicts the autocorrelation functions for 
passive galaxies, while underpredicts it for active galaxies. 



5 OVERDENSITY PEAK HEIGHT PROXY 
USING GALAXY LUMINOSITIES 

In Paper I, we presented an approach to trace the assembly 
bias effect. This consisted in measuring the total mass in- 
side spheres of different radii around semi-analytic galaxies, 
which in some cases could be larger than the virial mass of 
the host dark matter halo. Using two free parameters to in- 
troduce a dependence of this radius on mass and age, this 
model defined a new overdensity peak height for which the 
large-scale clustering of objects of a given mass did not de- 
pend on the age. We then analyzed the changes of this peak 
height proxy, compared to virial masses. We typically found 
that low-mass, old galaxies have larger peak heights than 



young galaxies of equal virial mass, which implies that the 
environment is more dense for old objects. They are prefer- 
entially surrounded by massive neighbour haloes, which is 
in agreement with other works that mention old, low-mass 
haloes may suffer a truncated growth due to massive neigh- 
bour haloes (Wang, Mo & Jing 2007; Dalai et al. 2008; Hahn 
et al. 2009). Therefore, the parametrization of the radius for 
the sphere to measure the environment is a useful tool to 
understand what is behind the assembly bias. 

However, it is difficult to obtain the underlying mass 
information for real galaxies. For this reason, the aim of 
this section is to design a similar procedure, but using the 
luminosity of galaxies. We take advantage of the fact that 
galaxies of equal virial mass have similar luminosities (c.f. 
Section [2.2p . The idea behind our work is not to minimize 
the bias, but it is to use this method to understand the 
physical features that produce the assembly bias. 

Instead of using spheres as in Paper I, we measure the 
total luminosity (which correlates with total substructure 
mass) inside a cylinder centred on each galaxy along the 
line-of-sight to take into account the limitations imposed by 
the redshift space distortions. The radius of the cylinder for 
each galaxy in Mpc units is parameterized as 

r = a 5t -I- fe (M. - Mr) , (8) 

wher e the charac teristic magnitude is A/* = —20.5 
(Zch avi et al. 20051 ). The free parameters are a and h. The 
length of the cylinder is Aw = ± 500 km/s centred in each 
galaxyQ We add the luminosity L of these objects to ob- 

^ The choice of cylinder length ± 500 km/s corresponds to a total 
length of 5 /i~^Mpc in the z-coordinate of the simulation. 
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Figure 10. Same as Figure[8]but for the galaxies in the simulation. Error bars were calculated using the jackknife method (see equation 
by dividing the sample in ten parts. The error of the ratio between the ^(r) of the old and young objects is shown as a shaded region 
around the unit ratio in each lower box. 



tain the total luminosity in this volume using the relation 
Mr — M« = —2.5 log(I//L,). When the radius is negative 
the new magnitude of the central object remains unchanged, 
i.e. Mr = Mr- If this radius is greater than 3 Mpc, the ob- 
ject will not be considered for further analysis (for the best- 
fitting set of parameters, none of the galaxies exceed this 
limit in radius). 

Notice that this cylinder will include neighbour galaxies 
and therefore this will be related to the crowding of the en- 
vironment of the galaxy, a measure of the local density, but 
with an adaptive volume size depending on the galaxy age 
and luminosity. This procedure to trace the assembly bias 
will use correlation functions of relatively old and young 



galaxies to fit the parameters of the cylinder radius, but 
this parametrization will only be said to correlate with the 
correction of Paper I (missing peak mass) if the excess of 
crowding in galaxies correlates with the excess of mass found 
in haloes reported in our previous paper. In order to demon- 
strate that this is the case, we first need to find the parame- 
ters that trace the assembly bias using this measurement of 
crowding, and then to check whether the relation between 
crowding and missing peak mass is present. In contrast to 
Paper I, we do not use the infall velocity profile because 
the velocity inferred from redshift space distortions is noisy 
compared to the velocity from numerical simulations. 
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Figure 11. Ratio between tlie correlation functions of the old 
population and that of the young population for the different 
ranges in luminosity indicated in each panel. Dashed lines (open 
squares) represent the real-space results of the SDSS galaxies as 
shown in Fig. |8l Solid lines (filled circles) correspond to those 
from the simulated galaxies (see Fig. IIOI I. Dotted horizontal lines 
show the unit ratio (no assembly bias). 

5.1 Simulated galaxies 

Our aim is to affect the model with the observational prob- 
lems for which we define the z-sxis of the simulation as the 
line-of-sight and turn the z-coordinate into a recession veloc- 
ity, v'z = lOQz + Vz, where Vz is the velocity component in the 
2-coordinate. When measuring the luminosity in each cylin- 
der in the numerical simulation, we include all the objects 
with stellar masses higher than 10^ M© and magnitude 
Mr brighter than the limit given for each sample in Table [2] 

We calculate the cross-correlation functions between 
samples of a given new magnitude and tracer galaxies 
that satisfy M,- < —18. We also select subsamples of old and 
young objects using the proxy for the age, 5t, as measured 
in Section [2. 1.1 1 but using M,'.. 

In order to find the best-fitting parameters that do not 
show an age dependence of the clustering, the reduced 
for the correlation function statistics is defined as 




and is calculated within the range 0.8 «; r/h~'^ Mpc 10, 
mostly in the two-halo term; Nyoung is the number of neigh- 
bours for young galaxies, whereas Nou is the same quan- 
tity for old objects. The number of neighbours is defined as 
N{r) = {Nt{r))^{r) +(Nt{r)), where {Nt{r)) is the mean 
number of tracers (see Paper I for details). The error is 
^Ar(r) = ^Afyoung + ^Afoid' calculated as the relative er- 
ror of the number of neighbours. The symbol Udof denotes 



Figure 12. Solid curves show the ratio between the correlation 
functions of the old and young populations of semi-analytic galax- 
ies for different ranges in magnitude indicated in each panel 
for the best-fitting values a = 0.75 and b = —0.36. Dotted horizon- 
tal lines show the unit ratio. For comparison, solid circles (dashed 
lines) correspond to the ratio shown in Fig. Illl for the simulated 
galaxies using Mr. Note that this comparison is just for reference 
because the adopted magnitudes are different. 

the number of degrees of freedom. The value x|(r) is the av- 
erage over four magnitude bins selected between the 10 and 
70 percentiles of the Mr distribution. This particular defini- 
tion of goodness of fit avoids selecting parameters favoured 
by large uncertainties that can induce spuriously good fits. 

The best-fitting parameters corresponding to the high- 
est likelihood in the full parameter space are a = 0.75 and 
b = -0.36 (x^ ~ 0.51). Figure [m shows the ratios of the 
correlation functions of old and young objects using these 
values for different magnitude ranges. The assembly bias is 
practically absent in the numerical simulation using this for- 
malism (compare with solid lines in Figure [TT|) . The average 
differences in clustering amplitude are smaller than a 10 per 
cent for each range in M^. 

Note that we do not include objects with fainter mag- 
nitudes than the limit given by the central galaxy in the 
cylinder. Figure [131 shows the true total magnitude if we also 
include these neighbour galaxies vs. Mr- There is a strong 
relation between both magnitudes with a small scatter that 
is more important for fainter objects. Furthermore, if we use 
all the neighbour galaxies, ignoring the restriction of the flux 
limited catalogues, the best-fitting parameters are a = 0.5 
and b — —0.36 (x^ ~ 1-01); the parameter a changes slightly 
to a lower value, while the parameter b is the same. Figure 
1141 shows the likelihood functions for the no-flux limit ap- 
proach (dashed lines) and those of the magnitude Mr (solid 
lines) . Notice that the maximum likelihood in this flgure oc- 
curs for parameter values that do not necessarily coincide 
with those of the maximum likelihood in the full parameter 
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Figure 13. Magnitude tliat accounts for the biased lumi- 
nosities around central simulated galaxies in cylinders of radius 
r = aSt + b{Mf — Mr), with free parameters a = 0.75 and 
b = —0.36 (see Section l5.ll for details), as a function of the true 
total magnitude in the r-band if the contribution of all the semi- 
analytic galaxies located in these cylinders is included. We find a 
strong relation between both magnitudes with a small scatter that 
is more important for fainter objects. These central semi-analytic 
galaxies are in the range —22 < Mr < —18. 

space. We find that both approaches for the luminosity in 
the r-band obtain similar maximum likelihood values consis- 
tent within a la confidence (indicated by the intersections 
of the parameter distributions with the dotted line). This 
justifies our use of as a good estimator of the total lu- 
minosity around galaxies in the range —22 < Mr < —18 
with the best-fitting values a — 0.75 and b — —0.36, avoid- 
ing the complications introduced by the flux limit nature of 
the observational catalogues. 

In the next section we show a similar approach to rede- 
fine the overdensity peak height using luminosities applied 
to SDSS galaxies. 

5.2 SDSS galaxies 

Figure [15] illustrates the volume-limited samples used in this 
section in order to avoid obervational bias, where Mr is the 
redefined luminosity. We have hundreds of volume-limited 
samples such as Figure [15] given by the different combina- 
tions of the parameters a and b, thus this was done au- 
tomatically in order to optimize the number of objects in 
selected subsamples in luminosity. The procedure to define 
these subsamples is as follows. For each cut in M'., we count 
the number of galaxies in four ranges of redshift indicated 
in the last column of Table [T] We then select the range that 
is most populated that defines faint and bright limits in Mr 
and a maximum redshift, zum- All the galaxies located be- 
tween the magnitude limits of this range with z ^ zum are 
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Figure 14. Marginalized likelihood functions for the free param- 
eters a and b (top and botton panels, respectively) using the mag- 
nitudes of semi-analytic galaxies described in Fig. I13l and Section 
15.11 the true total magnitude in the r-band (dashed lines) and 
Mr (solid lines). Both approaches for the luminosity of cylinders 
of radius r (Equation js} obtain similar maximum values in the 
range of la (intersection of distributions with the horizontal dot- 
ted line). We also include the results for the magnitude A/^ for 
SDSS galaxies described in Section 15.21 (dot-dashed lines). 



selected as members of a centre sample, and all the galaxies 
brighter than the faint-end limit of this range with z ^ zum 
are selected as a tracer sample in order to perform cross- 
correlation functions. For example, the samples shown in 
Figure [151 correspond to the parameters a = 0.5, b = —0.36. 
In this case, for the first cut in magnitude, we select the 
galaxies with -19.92 < < -19 and z 0.078 (red 
dots) as the first centre sample. Its respective tracer sam- 
ple are the galaxies that satisfy Mr ^ — 19 and z ^ 0.078. 
For the last cut in magnitude, we select the centre sample 
(cyan dots) using the conditions —23.39 ^ Ml ^ —22.23 and 
z ^ 0.078. Galaxies for the respective tracer sample satisfy 
M'r ^ -22.23 and z < 0.078. 

We again select subsamples in 5t using the distribution 
of the luminosity- weighted stellar age as a function of Ml. 
The correlation functions are obtained in redshift space as 
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Figure 15. Magnitude A/^ of SDSS galaxies as a function of red- 
siiift for tlie set of parameters a = 0.5, b = —0.36. Four volume- 
limited samples of centres and tracers are chosen to perform cross- 
correlation functions without observational bias. Further details 
of the selection of centres (in colour regions) and tracers in Section 

shown in Figure 1161 We estimate their real space counter- 
parts as described in Section ID 

The number of neighbour objects in equation ((9]) using 
SDSS samples is at least an order of magnitude lower than 
that in the simulation, so that the statistic to perform the 
reduced "x^ is affected by significant noise. The number of 
neighbours is typically smaller than dozens of SDSS objects, 
whereas this number is greater than a hundred for simulated 
objects. This is due to the way the number of neighbours is 
calculated in equation Q, which is proportional to the mean 
number of tracers at a distance r, (A'"t(r)) oc V {r) /Vtot, 
where the total volume of the simulation is Vtot ~ 10^ Mpc'^ 
and, for example, the total volume covered by the SDSS 
sample at z = 0.175 is Vtot ~ 10^ Mpc'^. For this reason, the 
reduced in this case (M^ for SDSS galaxies) is defined as 

where £,oid{r) and (,you7ig{r) is the result for old and young 
SDSS galaxies, respectively. The error is = A|^^^ + 
^iyoung ' ^hcrB thc first term is the jackknife error for (,oid{r) 
and the second for (,young{r). This statistics is estimated 
within the range 0.7 ^ r/h~^ Mpc ^ 10 and averaged over 
the four magnitude bins selected in M^. 

For the SDSS galaxies, the best-fitting values are a = 1, 
b = —0.36. The marginalized probability distributions for a 
and b are shown as dot-dashed lines in Figure 1141 and as 
can be seen there is a good agreement with the distribu- 
tions obtained from our simulation. It is remarkable that the 
parameter a has a value only slightly greater compared to 
that obtained for the galaxies in the simulation (o = 0.75), 
which lies within the uncertainties of this parameter, and 
the value of 6 is equal in both cases. The correlation func- 



tion ratios using these values are shown in Figure [TTl Again, 
the assembly bias is almost not present using the proxy for 
the overdensity peak height at r > 1 Mpc (average dif- 
ferences in clustering amplitude are smaller than a 10 per 
cent for each range in magnitude M^), thus confirming that 
our approach is able to define an initial peak-height proxy 
using the crowding of the region where galaxies lie, in a sim- 
ilar manner to that implemented in Paper I, and that SDSS 
galaxies show a similar behaviour to ACDM semi-analytic 
galaxies since their best-fitting sets of parameters are com- 
parable. 

In this context, as mentioned above. Figure [T?] shows 
the marginalized likelihood distributions for the free param- 
eters a and b of equation (|8} for three cases: the new magni- 
tude M'. for simulated galaxies, the true total magnitud in 
the r-band using also simulated galaxies and for SDSS 
galaxies. By estimating a weighted average for the values in 
the range of la for the three distributions, it is obtained 

a = 0.76 ±0.31 
6= -0.39 ±0.18 . 

The best estimates for the a parameter are always positive, 
whereas those for the b parameter are negative when defining 
an overdensity peak height proxy using luminosities. The 
next section will discuss the properties of this proxy of peaks 
when using galaxy luminosities. 



6 PEAKS DEFINED USING LUMINOSITY 

The redefinition of the overdensity peak height using galaxy 
luminosities is able to trace the assembly bias (Section [5]). 
Table [3] shows the median radius of equation ([S| in physi- 
cal units of kpc for the four different bins in Mr using 
the simulated galaxies. The results are shown for two sets of 
best-fitting values, the case (a = 0.75, 6 = —0.36) and 
the true total magnitude in the r-band (a = 0.5, b = —0.36). 
The radius of the cylinder for all the objects decreases to 
higher luminosities with very similar sizes for both sets of 
parameters. This is in agreement to what was discussed in 
Paper I, where the radius decreases to more massive objects, 
due to the correlation between luminosity and virial mass. 
Bear in mind that the radius can be negative, r < 0, since 
both the first and second term of equation (|8]) can be neg- 
ative, which implies — Mr- The results are also split 
according to the classification of old and young populations 
after performing the redefinition of luminosity for these sim- 
ulated galaxies. As well as Paper I, old objects have higher 
radii than young objects (of equal luminosity in this case), 
thus suggesting stronger enviromental dependences for old 
populations. The median sizes for both sets of parameters 
are similar for young galaxies, with differences smaller than 
37 h^^ kpc, while these differences for old galaxies range 
from 8 h^^ kpc out to 200 kpc toward brighter magni- 
tudes. 

Table [5] shows the median radius of equation (|8} using 
the SDSS galaxies. In general, there is a good agreement of 
the radius sizes with respect to the case (see Table |4]) , 
with differences among them smaller than 22 kpc for 
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Figure 16. Same as Figure [7] also for SDSS galaxies, but using the best-fitting values a = 1, b = —0.36 in equation {Sjl for galaxies 
the magnitude range —22.23 ^ ^ —21.08. There is no important differences between old and young populations. 



Table 4. Median radii in physical units {h~^ kpc) from equation ^ for simulated galaxies (see details in Section 15.111 . as given by the 
best-fitting parameters in the case (a = 0.75, b = —0.36) and the true total magnitude in the r-band (a = 0.5, b = —0.36). The results 
are shown for all objects and, also, split among the old and young populations of simulated galaxies after performing the redefinition of 
luminosity. The radius r < implies = Mr- The ranges in magnitude arc those shown in Table [T] 



Best-fitting parameters Ages (''/^ "'kpc) ('"/^ ""^kpc) (''/'* "'kpc) (^/^ ""^kpc) 

Mr = (-19, -18) Mr = (-20, -19) Mr = (-21, -20) Mr = (-22, -21) 



a = 0.75, b = -0.36 


All 


716.4 


380.6 


29.1 


< 




Old 


910.7 


723.1 


523.4 


206.6 




Young 


582.2 


< 


< 


< 


a = 0.50, b = -0.36 


All 


719.9 


385.0 


28.4 


< 




Old 


902.8 


596.7 


340.1 


6.5 




Young 


593.8 


36.6 


< 


< 



all the galaxies, 88 kpc for the old galaxies, and 24 
kpc for the young galaxies. This means that the best-fitting 
set of parameters that make the assembly bias practically 
absent for both simulated and real galaxies are truly similar 
when using luminosity as a proxy of peak height. 

The method followed to redefine the luminosity of a 
galaxy could be compared to adding up the mass from the 
smooth density field beyond the virial radius. We compare 
the resulting Af^ luminosity function with that of the total 
luminosity in haloes in the simulation to help interpret the 
changes introduced by our approach; this is equivalent to 
our procedure in Paper I where we compared the mass func- 
tions for the original virial mass, and that resulting from 
our redefinition of peak height, although in this case the 
luminosity function does not have a theoretical prediction. 
Figure [181 shows differences in the luminosity function that 
results from the redefinition of luminosity Mr for central 
galaxies (solid circles) and that from adding the luminosity 
in the r-band of all the galaxies with Mr < —18 in the same 
friends-of-friends (FOF) halo (open triangles) as an approx- 



imation of the total luminosity within the peak. The latter 
is defined for FOF haloes where the magnitude of their cen- 
tral semi-analytic galaxy is Mr > —22, otherwise the mag- 
nitude of FOF haloes is simply that of their central galaxy. 
As our formalism adds luminosities of galaxies embedded in 
haloes, one could have expected the number of these objects 
with magnitudes Mr to be similar to that of all the galaxies 
in FOF haloes, but the difference between the luminosity 
functions measured with Air and Mr is considerable. In this 
section we will study the origin of this discrepancy. 

In Paper I we found that the redefinition based on the 
mass density field does not change the halo mass function 
at M > 10'^ Mq (see their Fig. 8) when compared to 
that of Sheth, Mo, & Tormen (2001, SMT) or that obtained 
from the virial mass, which is in agreement with the result 
found by Gao, Springel & White (2005) where the assem- 
bly bias with respect to formation time is not important for 
the high mass regime. The reason for the change found in 
the low-mass regime is that the peak for an old, low-mass 
object adds more haloes and mass than for a young object 
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Table 5. Median radii in physical units {h~^ kpc) from equation for SDSS galaxies (see details in Section 15.211 . as given by the 
best-fitting parameters a = 1, ft = —0.36. The results are shown for all objects and, also, split among the old and young populations of 
observed galaxies after performing the redefinition of luminosity. The radius r < implies Af/. = ■ The ranges in magnitude are those 
shown in Tables [Tl and [4l 



Best-fitting parameters Ages ix/h ''"kpc) ('"/^ """kpc) (j'/h ■'"kpc) ij'/h """kpc) 

Mr = (-19, -18) Mr = (-20, -19) Mr = (-21, -20) Mr = (-22, -21) 
a = 1.0, b = -0.36 All 694.6 358.3 28.2 < 

Old 851.8 726.1 535.7 294.5 

Young 606.1 < < < 
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Figure 17. Same as Fig. 1121 but using the best-fitting values 
for the SDSS galaxies a = 1 and b = —0.36 (solid lines). For 
comparison, open squares (dashed lines) correspond to the ratio 
shown in Fig. II II for the SDSS galaxies using Mr - Note that this 
comparison is just for reference because the adopted magnitudes 
are different. 

of equal mass. For the following test, instead of considering 
the mass from dark matter (DM) particles inside a sphere, 
which corresponds to a continuous field, we use the mass of 
dark matter haloes in spheres, which is a discrete formal- 
ism similar to using the luminosity of individual galaxies. 
Bear in mind that both estimators measure the neighbours 
enclosed within the sphere calculating the crowding of the 
region. In the top and middle panels of Figure [T9l we show 
the correlation function and the velocity infall profile for old 
and young objects of a given mass for the best-fitting pa- 
rameters a = 0.08 and h — —0.32 in Equation (14) of Paper 
I, resulting from using this discrete approach with neigh- 
bour DM haloes. Compared to the best-fit parameters of 
Paper I (a = 0.2 and b = —0.02), these are significantly dif- 
ferent. The difference between the approach for haloes and 
that used for a smooth density field is evident when looking 
at the mass function, shown as open triangles in the bot- 
tom panel, which is completely different than that using the 



Figure 18. Luminosity function of Mr magnitudes of semi- 
analytic galaxies using the parameters a = 0.75 and b = —0.36 
(solid circles). Open triangles correspond to the total r-band lu- 
minosity function of FOF haloes given by the contribution from 
their simulated galaxies with Mr < —18. 

virial mass (green squares) as well as the predicted mass 
functions from the extended Press-Schechter model (EPS) 
and from the SMT model shown as long-dashed and dot- 
dashed lines, respectively. We also compare it with a set of 
parameters with good-fitting results to trace the assembly 
bias using the continuum formalism (a = 0.2 and b = —0.02, 
open circles). The behaviour of the latter mass function is 
in agreement with the virial mass, and the EPS and SMT 
models. 

The discrete formalism actually measures the crowd- 
ing around objects which constitutes a different proxy of 
the peak height than the total mass. Figure |20] compares 
the result obtained with the best-fitting set of values a = 
0.75, b — —0.36 for the luminosity of individual simulated 
galaxies and that obtained in Paper I, which considers the 
mass of DM particles. There is a clear correlation between 
both methods, showing that the same phenomenon is being 
described. We calculate a correlation coefficient using the 
Pearson's product-moment coefficient. 
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Figure 19. Results using the discrete method for the mass in 
the simulation. The correlaton functions (top) and the velocity 
profiles (middle) for old and young simulated galaxies (St > 0.05 
and 5t < —0.05, respectively) are similar at large-scales using the 
best-fitting parameters a = 0.08, b = —0.32. However, these val- 
ues do not produce an adequate mass function (triangles, bottom 
panel) compared with that from the virial mass (squares), the 
EPS (long-dashed line) and the SMT (dot-dashed line) models. 



Figure 20. Estimates of the overdensity peak height proxy us- 
ing the approach followed in Paper I that considers the mass of 
dark matter particles (best-fitting set of parameters a = 0.2 and 
b = —0.02, X-axis), and the luminosity of individual simulated 
galaxies (a = 0.75 and 6 = —0.36, y-axis). The mass is in units 
of the virial mass of the DM substructure that hosts each galaxy. 
The luminosity is shown as a difference between the original mag- 
nitude Mr and M^, which is described in Section |5. II Solid circles 
are the median and the error bars correspond to the 10 and 90 
percentiles of the distribution. 



c = 



^(Ami - Am) (AM - AM) 



(11) 



, ^(Am, - Am)2 ^(AM - AM)^ 

\ i=l i=l 



where Am — Mr — M'r is the difference between the original 
magnitude Mr and the new one M'r (y-axis of Fig. I20|) . Am 
is its average using the A'^ objects, AM = log(M/M„ir) is 
the new mass in units of the original virial mass (x-axis 
of Fig. [20]), and AM is its average. We obtain C = 0.73, 
which means that there is a strong correlation between both 
formalisms. 

We conclude that although our method to measure the 
total luminosity will not reproduce the luminosity function, 
the reason for this is only the use of discrete neighbours. If we 
were able to measure the distribution of mass around galax- 
ies and turn it into luminosity, we would obtain a more phys- 
ical luminosity function, as we did in the case of the mass. 
This result implies that the phenomena behind the assem- 
bly bias, namely, the truncation of mass infall in haloes of 
low mass embedded in high density regions, can be detected 
with the new proxy for peak height that can be applied to 
real galaxy surveys. 



7 CONCLUSIONS 

We have shown that the assembly bias effect, namely, the 
difference in the clustering amplitude at large scales of pop- 
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ulations of equal mass or luminosity but different age, ex- 
ists for both semi-analytic galaxies and real galaxies from a 
SDSS catalogue. This is a relevant issue that could affect the 
ability of the next generation of galaxy surveys to infer accu- 
rate cosmological parameters. Using the two-point correla- 
tion function, the clustering strength of old SDSS galaxies is 
from a 25 per cent to a 45 per cent higher than that of young 
SDSS galaxies of same magnitude at scales r > 1 Mpc. 
The estimator of age is based on the mass- and luminosity- 
weighted stellar ages for the synthetic and SDSS galaxies, 
respectively. In this context, we show that the _D„4000 in- 
dex is a less adequate parameter to separate active/young 
and quenched/old galaxies for assembly bias studies using 
correlation functions at large scales. 

Lacerna & Padilla (2011, Paper I) presented an over- 
density peak height proxy with the aim to understand the 
assembly bias effect. This new definition was proposed as a 
better alternative than the virial mass. In this work we adapt 
this model including observational issues using galaxy lumi- 
nosities, taking advantage of their correlation with substruc- 
ture mass. As the SDSS is limited in flux, and as the tracers 
of mass are the galaxies themselves, the method is modified 
to use the crowding of the environment around galaxies in 
selected volume-limited samples, instead of searching for a 
correction to the virial mass as was done in Paper I. We 
measure the total light in cylinders centred in each galaxy, 
thus obtaining an estimate of the crowding around galax- 
ies, that traces the asembly bias. The radius of this cylinder 
is parameterized as a function of stellar age and luminosity. 
The best-fitting sets of parameters in the simulation and ob- 
servations are very similar (a = 0.75, b = —0.36 and a — 1, 
b = —0.36, respectively). In both cases the asembly bias is 
practically absent using this approach. The average differ- 
ences in clustering amplitude between old and young pop- 
ulations for both semi-analytic and observational galaxies 
are smaller than a 10 percent for each range in magnitude 
M^. By estimating a weighted average for the values in the 
range of la for semi-analytic and SDSS galaxies, it is ob- 
tained that the set of best-fitting parameters for the radius 
of the cylinder is a = 0.76 ± 0.31, b = -0.39 ± 0.18. 

This latter model, which constitutes a discrete formal- 
ism when counting galaxies within a cylinder or haloes 
within spheres, does not reproduce the original luminosity 
or mass functions since it actually measures the crowding 
around galaxies. However, it is correlated with the smooth 
density field approach of Paper I. Therefore, both formalisms 
are associated with the fact that the mass of peaks that did 
not collapse on to haloes will not be an appropiate proxy 
for the peak height. It is likely that low-mass haloes in high 
density environments show their growth truncated by the 
gravitational effects of their (massive) neighbours. Our re- 
sults indicate that this also affects the growth of the stellar 
mass in galaxies in addition to the growth of dark matter 
haloes. This could induce a misclassification of galaxies in 
theoretical models to populate haloes, such as HOD, because 
using virial mass instead of proper peak height. 

These results will be of high importance for the next 
generation of galaxy surveys, such as LSST. The huge 
amount of information available will allow us to measure the 
clustering amplitude of galaxies with significant statistics at 
Mpc scales. Therefore, as we show in this paper, it will be 
absolutely necessary to model the effect of the environment 



on the two-halo regime in order to understand its role in the 
formation and evolution of haloes and galaxies. Theoretical 
models will have to include this phenomenon in order to re- 
produce the observational data with high accuracy in the 
new era of precision cosmology. 
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